MetricHunter: A software metric dataset generator utilizing SourceMonitor upon public GitHub repositories
نویسندگان
چکیده
Version control systems are pervasively consulted nowadays to obtain software metric datasets. Accordingly, machine learning is applied predict different aspects of a including quality monitoring, influence analysis, etc. However, construction dataset challenging and the content may affect success learning-based models. In this study, we propose tool, MetricHunter, which able produce platform/language specific datasets that can be used for predicting features newly created software. The proposed tool developed by C# programming language utilizing known gathering i.e. SourceMonitor, GitHub REST API public repositories. Thus, one construct proper from graphical user interface simply specifying or target platform. outputs on set repositories validated investigating automatically generated attribute values comparing them with measurements tools as well values.
منابع مشابه
PoDiGG: A Public Transport RDF Dataset Generator
A large amount of public transport data is made available by many different providers, which makes rdf a great method for integrating these datasets. Furthermore, this type of data provides a great source of information that combines both geospatial and temporal data. These aspects are currently undertested in rdf data management systems, because of the limited availability of realistic input d...
متن کاملA Study of Scala Repositories on Github
Functional programming appears to be enjoying a renaissance of interest for developing practical, ―real-world‖ applications. Proponents have long maintained that the functional style is a better way to modularize programs and reduce complexity. What is new in this paper is we test this claim by studying the complexity of open source codes written in Scala, a modern language that unifies functio...
متن کاملInfluence analysis of Github repositories
With the support of cloud computing techniques, social coding platforms have changed the style of software development. Github is now the most popular social coding platform and project hosting service. Software developers of various levels keep entering Github, and use Github to save their public and private software projects. The large amounts of software developers and software repositories ...
متن کاملUnusual Events in GitHub Repositories
In large and active software projects, it becomes impractical for a developer to stay aware of all project activity. While it might not be necessary to know about each commit or issue, it is arguably important to know about the ones that are unusual. To investigate this hypothesis, we identified unusual events in 200 GitHub projects using a comprehensive list of ways in which an artifact can be...
متن کاملCollaborative Topic Modeling for Recommending GitHub Repositories
The rise of distributed version control systems has led to a significant increase in the number of open source projects available online. As a consequence, finding relevant projects has become more difficult for programmers. Item recommendation provides a way to solve this problem. In this paper, we utilize a recently proposed algorithm that combines traditional collaborative filtering and prob...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: SoftwareX
سال: 2023
ISSN: ['2352-7110']
DOI: https://doi.org/10.1016/j.softx.2023.101499